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ABSTRACT 


Optimal  missile  avoidance  is  analyzed  with  a  two-dimensional 
linearized  kinematic  model.  It  is  shown  that  inclusion  of  a  control- 
effort  penalization  term  in  the  payoff  function  leads  to  extend  the 
domain  of  validity  of  the  trajectory  linearization.  The  required 
values  of  the  weighting  coefficient  of  the  control  penalization  and 
the  resulting  loss  in  the  optimal  miss  distance  are  evaluated.  A 
recursive  algorithm  for  the  numerical  solution  of  the  modified 
optimal  avoidance  problem  is  presented.  The  optimal  solutions 
obtained  for  the  linearized  kinematical  model  are  compared  to  results 
of  non-linear  simulation. 
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I .  INTRODUCTION 

The  problem  of  optimal  missile  avoidance  was  analysed  in  the  past 
using  different  types  of  simplified  mathematical  models.  Three  types  of 
simplifying  assumptions  were  used: 

a.  Neglecting  guidance  dynamics.1-3 

b.  Restricting  the  motion  to  a  plane.3-10 

c.  Trajectory  linearization.1-3’7’9-12 

It  was  shown  that  the  attractive  assumption,  made  by  neglecting  the 
dynamics  of  the  pursuer,  yields  seriously  misleading  results:’2’3  When¬ 
ever  guidance  dynamics  is  considered4  12  (even  if  by  an  approximation 
of  a  first  order  time  constant  or  a  pure  time  delay) ,  optimal  evasion 
can  guarantee  non-zero  miss  distance  even  from  a  pursuer  of  unlimited 
maneuverability7  or  from  one  of  an  optimal  guidance  strategy.12 

Trajectory  linearization,  both  for  two  and  three  dimensional 
models  has  been  proved  to  be  a  useful  way  to  obtain  analytical  solutions 
providing  an  insight  to  the  problem.  Recently  accomplished  studies 
with  a  linearized  kinematic  model10’  11  indicate  that  the  optimal  maneuver 
for  missile  avoidance  is  a  "bang-bang"  type  with  the  continuous  use  of 
maximum  load  factor  of  the  evading  airplane.  It  can  be  therefore  reduced 
to  an  optimal  roll -position  control  problem  of  two  consecutive  phases: 

(1)  Orienting  the  airplane  lateral  acceleration  vector  into  the  plane 
of  optimal  evasion;  (2)  Changing  the  direction  of  this  acceleration, 
which  has  to  be  maximal,  by  rapid  roll  maneuvers  of  1806  in  accordance 
with  an  optimal  switch  function. 


However,  the  validity  of  trajectory  linearization  is  not  always 
obvious.  This  assumption  is  valid  as  long  as  the  evader's  trajectory 
does  not  deviate  much  from  its  initial  direction.  This  requirement 
can  be  satisfied  if: 

a.  The  dynamic  similarity  parameter  of  the  problem,13  defined  by 
direction  change  of  the  evader  during  a  period  of  the  pursuer's  time 
constant,  is  small. 

b.  The  solution  does  not  include  excessively  long  turns  in  one  direction. 

The  first  condition  has  to  be  examined  before  trajectory  linearization. 
The  second  one,  however,  can  be  verified  only  a--posteriori .  Due  to  the 
"bang-bang"  structure  of  the  optimal  evasive  maneuver,  in  most  cases  this 
second  condition  is  also  satisfied.  A  recent  investigation 14  has  shown 
that  there  exists  a  range  of  parameters  (long  flight  times,  small  values 
of  effective  proportional  navigation  constants,  low  missi le/target 
maneuver  ratios)  for  which  long  turns  are  predicted  by  the  linearized 
kinematic  model  maximizing  the  miss  distance.  Moreover,  it  has  been 
shown10  that  the  sensitivity  of  the  miss  distance  to  target  maneuver, 
performed  far  away  from  the  point  of  closest  approach  is  relatively  small. 

The  objective  of  the  present  paper  is  to  modify  slightly  the 
optimal  control  problem,  to  enable  the  extension  of  the  validity  of 
trajectory  linearization. 

In  this  new  formulation  the  payoff  to  be  maximized  is  the  square 
of  the  miss  distance  penalized  by  a  quadratic  integral  term  of  the 


control  effort.  A  similar  cost  function  has  been  used  in  the  past8  to 
avoid  numerical  difficulties  in  singular  control.  The  optimal  miss 
distance  obtained  with  this  new  formulation  will  be,  no  doubt,  smaller 
than  the  value  predicted  in  the  original  problem. 

Recently,  it  has  been  proven15  that  the  difference  between  the 
miss  distance  obtained  using  different  cost  functions  can  be  made  small  by 
proper  choice  of  the  weighting  coefficient  of  the  integral  term. 

In  the  present  work  the  emphasis  is  to  eliminate  the  long  initial 
turn  of  the  optimal  maneuver  sequence,  which  can  be  achieved  by  using  a 
relatively  large  weighting  coefficient  for  the  control  penalization. 

The  analysis  is  carried  out  under  the  following  set  of  assumptions: 

1.  Missile  and  target  are  considered  as  constant  speed  point-mass 
elements. 

2.  The  missile  is  guided  by  proportional  navigation  with  constant 
effective  P.N.  coefficient. 

3.  Both  missile  and  . jet  perform  lateral  accelerations  perpendicular  to 
the  initial  line  of  sight. 

4.  The  deviation  of  the  trajectory  from  the  reference  line  of  sight  can 

be  decomposed  in  two  perpendicular  planes.  For  the  sake  of  simplicity 
only  one  of  these  planes  is  considered  and  the  gravity  component 
in  this  plane  is  neglected. 

5.  The  dynamic  response  of  the  guidance  system  is  approximated  by  a 

first  order  transfer  function  with  a  time  constant  t. 
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6.  The  missile  has  unbounded  lateral  acceleration. 

7.  Target  dynamics  are  neglected  in  first  approximation  but  target 
lateral  acceleration  is  bounded. 

Based  on  these  assumptions  the  modified  optimal  missile  avoidance 
problem  is  formulated  in  Section  II  in  a  nondimen sional  form,  and 
solved  in  a  closed  form  (up  to  a  multiplicative  constant)  in  Section 
III.  The  criteria  for  the  selection  of  the  proper  weighting  of  control 
effort  penalization  term  is  outlined  in  Section  IV,  while  the  reduction 
in  the  optimal  miss  distance  due  to  the  penalization  is  estimated  in 
Section  V.  In  Section  VI  a  simple  recursive  algorithm  for  computing 
the  solution  of  the  modified  optimal  missile  avoidance  problem  is 
presented.  In  the  sequel  the  extension  of  the  solution  to  cases  of 
limited  missile  acceleration  and  target  roll  rate  is  discussed.  Solution 
of  the  linearized  modified  optimal  missile  avoidance  is  compared  to 
results  of  non-linear  simulation  in  Section  VII 


II.  PROBLEM  STATEMENT 


A.  Mathematical  Model 

The  goemetry  of  a  two-dimensional  pursuit-evasion  is  shown  in 
Fig.  1  defining  the  parameters  of  the  problem.  The  equations  of  motion 
of  optimal  missile  avoidance  can  be  written,  subject  to  the  set  of 
assumptions  outlined  in  the  Introduction,  as  follows: 

a.  Relative  geometry  perpendicular  to  the  line  of  sight 

y(t)  =  yT(t)  -  yM(t)  (1) 

and  consequently  the  relative  acceleration  is 

y(t)  =  yT(t)  -  yM(t)  (2) 


b.  Missile  guidance  transfer  function  is  expressed  by 


T  \  +  >"m  =  (Vc 


(3) 


c.  Missile  acceleration  command  is  obtained  according  to  the  guidance 


law  of  Proportional  Navigation 
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^m-’c  ”  (tf-t) 


(4) 


The  time  of  of  flight  of  the  missile  t^  is  fixed,  determined  by 


Target  acceleration  perpendicular  to  the  line  of  sight,  which  is 


the  control  function  of  the  problem,  is  bounded 
|yT(t)|  £  (aT)max 


(6) 


Introducing  nondimensional  variables  for  time  and  distance13  by 


t  =  t/r 


y  =  _ ' - 

T  ^T^max 


(7) 


leads  to  normalize  the  velocity  components  by  T(aT)max  and  accelerations 
by  (aj)max'  As  a  result  Eqs.(l),  (2),  and  (4)  are  transformed  to 


y(t)  =  yT(t)  -  yM(t) 

y +  = 


Z  ,  N' 

^MC  ~  ~ 


t  y 


N ' 

+  — — —  y 


(tf-t)‘  (tf-t) 


(8) 


with  the  constraint 


The  non-dimensional  state  vector  of  the  problem  can  thus  be  defined  as 


XT(t:)  =  [y(t)  ,y(t)  ,yM(t)] 


If  the  non-dimensional  control  vector  is  defined  as 


uT(t)  =  col[0,u,0]  =  col [0,yT(t) ,0] 


The  normalized  state  equation  can  be  written  as 


dX 

—  =  A  (t)  +  u 

dt 


with 


(10) 


(11) 


(12) 


(13) 


and  the  constraint 


B.  Formulation  of  the  Optimal  Control  Problem 

The  objective  of  the  missile  avoidance  is  to  maximize  the  survivability 
of  the  evading  aircraft.  Assuming  uniformly  performing  warhead  and 
proximity  fuse  leads  to  determine  the  payoff  as  the  square  of  the  miss 
distance.  For  normalized  parameters  and  linearized  kinematics  it  is 
expressed  as 

J  =  y2(tf)  =  m2  (15) 

The  optimal  missile  avoidance  with  the  above  described  mathematical 
model  can  be  formulated  as  a  fixed  duration  optimal  control  problem. 

This  problem  with  the  payoff  (15)  was  solved  in  a  previous  work,10 
yielding  a  non-singular  bang-bang  solution.  This  solution  predicts, 
for  long  normalized  times  of  flight,  long  initial  maneuvers .  Implementation 
of  such  a  maneuver  in  the  real  world  (described  by  non-linear  equations) 
results  in  significant  changes  in  interception  geometry,  and  may 
consequently  invalidate  the  assumptions  of  the  linearized  kinematic  model. 
It  has  been  also  shown10  that  the  sensitivity  of  the  miss-distance  to 
target  acceleration  performed  far  away  from  intercept  is  negligibly  small. 

In  order  to  avoid  excessively  long  and  inefficient  maneuvers  it  is 
proposed  to  modify  the  payoff  of  Eq.(15)  by  adding  a  term  of  a  target 
maneuver  effort  penalization.  Such  a  modified  payoff  for  the  non- 


dimensional  mathematical  model  has  the  form 
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The  value  of  the  weighting  coefficient  K  has  to  be  specified  later. 

The  modified  optimal  evasion  problem  can  be  thus  formulated: 

Given  the  dynamic  system  described  by  Eqs. (12)  with  zero  initial 
conditions  (X^  =  0)  and  unspecified  terminal  state ,  find,  for  a 
fixed  normalized  time  of  flight  t^  ,  the  optimal  control  u*(t)  subject 
to  the  constraint  (14),  which  maximizes  the  payoff  given  in  Eq.(16). 

The  payoff  function  in  the  form  of  Eq.(16)  was  used  in  the  past8 
to  avoid  numerical  difficulties.  Here  it  is  used  for  a  different 
purpose  and  it  will  be  shown  that  the  value  of  the  weighting  coefficient 
K  will  determine  the  domain  of  validity  of  the  linearized  kinematic 
model . 


III.  SOLUTION  OF  THE  MODIFIED  OPTIMAL  CONTROL  PROBLEM 

For  the  optimal  control  problem  formulated  in  the  previous  section, 
the  variational  Hamiltonian 

H (x.A.u.t)  =  -Ku2  +  X^[A(t)x  +  u]  (17) 

can  be  rewritten,  separating  the  part  independent  of  the  control  variable 
u,  as 


H  =  HQ(x,X_,t)  -  Ku2  +  X2u 


(18) 


r 


-  10  - 


The  components  of  the  costate  vector  X  are  determined  by  the  adjoint 


equation 


dX/dt  =  -3H/3x 


with  the  terminal  conditions 


X  L  (t  f )  =  -2x1(tf)  =  -2m 


W  =  VV  =  0 


For  a  linear  system  as  (12)  Eq. (19)  yields 


dX/dt  =  -AT  (t) X 


which  can  be  transformed  by  introducing  the  normalized  time-to-go 


0  =  tf  -  t 


dx/de  =  a  (0) x 


with  the  initial  conditions 


A  j  (0=0)  =  -2m 


x2(0)  =  X3(0)  =  0 


The  system  of  equations  (24)  can  be  reduced  to  a  scalar  differential 
equation  of  the  form 
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Up 
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+  N' 


(26) 


which  was  partially  solved  in  the  past.10  A  complete  close-form  solution 
for  integer  values  of  N',  is  presented  in  the  Appendix. 

The  optimal  control  u*  is  obtained  by 

it  =  arg  max  H  =  arg  max  (-Ku^  +  X  u)  (27 

|u|<i  2 


yielding 


u* (0)  =  sat 


X$(0) 


2K 


where 


X2(0)  =  -2m  f2(0) 


(28 


(29 


and  f 2 (63  is  given  by  F.q.(A-13) 


f  2  (0) 


N'-2 

e-0  l  (-l)"'-2-1 

i=0 


N'-2] 

■  i  >  eN'-l-i 


(N'-l-i)! 


(30 


The  saturation  function  is  defined  by 


sat (a) 


a  if  |  a  I  <  1 

sign(a)  if  |a|  *  1 


(31 


For  integer  values  of  N'  the  value  of  X^ft),  hence  the  value 
of  u*(t)  can  be  computed  in  a  closed-form  up  to  a  multiplicative 
constant.  It  can  be  easily  seen  that  as  K  approaches  zero,  u*(t) 
becomes  a  "bang-bang"  control.  The  normalized  form  of  X ^  is  depicted 
in  Fig.  2.  In  guidance  analysis  this  function  is  well  known  as  the 
miss  distance  sensitivity  function  to  a  unit  target  lateral  acceleration 
impulse.  The  exponential  form  of  Eq.(30)  leads  to  affirm,  as  it  can  be 
seen  also  in  Fig.  2,  that 

lim  X  (0)  =  0  (32) 

0  -*oo  L 

In  Figs.  3  and  4  the  optimal  control  function  u*(0)  is  depicted  for 
different  values  of  N'  and  K. 

From  these  figures  we  observe  that  the  optimal  control  solution, 
for  long  normalized  time  of  flight  (but  using  a  not  excessively  large 
weighting  coefficient  K)  consists  of  3  phases: 

1.  An  initial  phase  of  no-maneuver. 

2.  A  phase  of  gradually  increasing  maneuver. 

3.  An  almost  "bang-bang"  terminal  phase. 

Thus  excessively  long  initial  target  maneuvers  can  be  avoided  by 
a  proper  choice  of  K. 

The  miss  distance  using  non-zero  values  of  K  will  be  obviously 
smaller  than  the  miss  distance  with  K*0.  In  a  recent  study15  it  was 
shown  that  the  upper  bound  of  the  difference  in  the  miss  distances 
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can  be  estimated.  Such  an  estimate  will  be  made  in  the  sequel  after  the 
criteria  of  selecting  the  proper  value  of  the  penalization  coefficient 
K  is  discussed. 


IV.  DETERMINATION  OF  THE  WEIGHTING  COEFFICIENT  K. 

In  the  mathematical  model  used  in  this  work,  trajectory  linearization 
is  performed  assuming  that  both  vehicles  have  constant  speeds  and  that  they 
do  not  perform  excessive  turns.  The  first  assumption  appears  inherently 
in  the  Kqs.(4)  and  (5).  The  second  assumption,  however  has  to  be  verified 
using  the  optimal  control  function. 

In  this  section  a  method  is  presented  which  leads  to  determine  a 
proper  value  of  the  weighting  coefficient  K  of  the  control  effort 
penalization  in  Kq.(16),  such  that  the  validity  of  trajectory  lineariza¬ 
tion  is  guaranteed. 

The  "bang -bang"  structure  of  the  optimal  missile  avoidance 
maneu\ er  at  the  terminal  phase  leads  to  conclude  that  validity  of 
trajectory  linearization  can  be  achieved  by  avoiding  excessively  long 
initial  maneuvers  in  a  constant  direction. 

Let  us  define  as  tj  the  time  at  which  the  first  direction  change 
of  the  evader's  lateral  acceleration  occurs  (see  Figs.  3  and  4). 

The  validity  of  trajectory  linearization  can  be  preserved  by 


assuming  that  in  the  initial  phase  of  the  evasion  (t  <  t^)  the  direction 
change  of  the  target  is  bounded  by  some  limiting  value  (AT)  This 

limiting  value  has  to  be  empirically  specified. 

The  target's  turning  rate  is  given  by 


(33) 


assuming  constant  velocity,  the  target's  initial  direction  change, 
(Ayt)1  is  given  by 


(Ayt)  x 


aT(t)dt| 


(34) 


and  it  is  required  that 

<*Vl  *  (Ar)max 

Using  the  normalized  variables  of  Eq.(7)  and  noting  that 

u  4  -  aT 

«  ^aT^max  ^aT^max 


(35) 


(36) 


Eq .  (34)  can  be  rewritten  as 
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Defining  the  dynamic  similarity  parameter,  a  , 


(in  ;  id  i  j in.  ■  i.y 


~  A 
aT  - 


T(<Vmax 

V.„ 


(38) 


and  substituting  it  into  l.q.(37)  yields 

j1  -  -  .  r 

( A> .J. )  J  T  u(t)dt  =  '•XplAy.j.)  j  (3D) 

0 

where  (Ay  )  ,  the  initial  normalized  target  velocity  change  perpendicular 

to  the  initial  line  of  sight,  is  defined  by 


(A-Vi 


J  u(t)dt; 

0 


(10 ' 


In  I  ig.  S  (Ay.,.)  j  is  depicted  as  a  function  of  the  weighting  coefficient 
k  for  different  values  of  N' .  Substitution  of  (39)  into  (33)  leads  to 
I  ho  inequality 


(AT) 


(Avt),  * 


max 


aT 


(41 ) 


If  the  value  of  (AD  is  chosen  to  be  in  the  order  of  20-30  degrees 

max 

the  validity  of  the  linearization  will  not  be  violated.  Thus  for  any 
given  problem  the  proper  value  of  K  can  be  determined  using  (41)  and 


In  the  following  an  approximate  method  to  evaluate  the  required 


value  of  K  is  presented. 

For  a  given  (Ar)max  and  “t’  a  maximU111  admissible  value  of  (AvT)  j 
is  obtained  using  (41), 


max 


AF 


max 


(42) 


We  define  (see  Fig.  3) 


0 


1 


(43) 


and 


0 

u 


A 


+ 


max 


(44) 


Thus  a  unit  step  of  the  control,  applied  at  0^,  will  generate  (Ay-p  )max 
as  given  in  F.q  .  (42)  . 

Inspection  of  Figs.  3  and  4  indicates  that  the  actual  value  of  the 
optimal  control  function  at 


|u(0u)| =  0  <  1 


Assuming  a  parabolic  approximation  for  the  control  function  in  the 
unsaturated  phase  and  using  F.q.  (28)  and  (29)  we  obtain  (n  =  4/9) 
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iu(V 


W 


(46) 


yielding  an  approximation  for  K 


K  = 


(47) 


The  actual  value  of  n  ranges  from  0.25  to  0.5. 

Using  this  approximation  will  eventually  lead  to  a  value  of  (Ay  ) j  only 
slightly  different  than  the  limit  determined  in  Eq.(42). 


V.  EFFECT  OF  CONTROL -E1TOKT  PENALIZATION  ON  THE  MISS  DISTANCE 

The  use  of  control-effort  penalisation  term  in  the  modified  cost  J 
(Eq.16)  reduces  the  optimal  miss  distance  compared  to  the  one  obtained 
optimizing  the  original  cost  function  J  (Eq.15).  The  bounds  of  this 
difference  can  be  calculated  using  the  method  presented  in  a  previous  work .  lb 

We  denote  the  optimal  control  function  maximizing  J  by  u* ,  and 
the  resulting  miss  distance  by  m*,  wbi le  the  miss  distance  obtained  by 
maximizing  J  is  denoted  by  in*. 


Obviously  we  have 
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Since 


2  -  2 
m*  £  m*“ 


we  have 


0  £  m*‘ 


')  dt 


(49) 


(50) 


As  can  be  seen  from  Figs.  (3)  and  (4),  both  controls  are  almost  equal 
except  for  the  initial  unsaturated  part  of  u*.  Let  us  denote  by  t 
the  normalized  time  when  saturation  starts.  The  integral  in  the  right  side 
of  (50)  can  be  thus  decomposed 


u*2)dt  = 


t 


0 


? 

u*“)dt  + 


u*2)dt  (51) 


Since  the  second  integral  has  a  negligeable  small  value  (see  Figs.  3  and  41 
we  may  write  approximately 


■  -  2  2 
(u*  -  u  )dt 


(u 


-^2 


-  u’ 


')df 


(52) 


We  also  recall  that  the  optimal  control  of  the  original  problem  is  of 
"bang-bang"  type,  j.c., 

u*2  =  1  (53) 


Substituting  (52)  and  (53)  into  (50)  leads  to 


m*2  S  K 


(ts  -  V 


dt 


or 


_  ?  ^ 


0  i  m*~  -  111*“  *  K(t  -  t  )  Cl  -  u*“) 


where  u*  is  the  average  value  of  tlie  optimal  control  u*  in  the 


interval  tQ  »  t  v,  t  being  defined  by 


2  A  1 


%  '  to  t. 


u*“  dt  *  1 


Dividing  ):q.(55)  by  m* 


-,2  ,2  K(t  -  tj(l  -  u*“). 

m*  -  m*  s  0  A 


an  upper  bound  for  the  difference  between  the  normalized  miss  distance: 


Am*  -  (m*  -  m* )  i  0 


can  be  obtained  Assuming  that  c  «  1,  i.e., 
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m*  -  m*  =  (m*  +  m*)Am*  »  2m*  Am* 


-  20  - 

the  upper  bound  of  Am*  (for  t^  =  0)  becomes 

Kt  (1-u*2) 

Am*  *  — - -  (60) 

2m* 

A  rough  approximation  of  this  upper  bound  is  obtained  by  taking  the 
"worst  case",  i.e.,  t  =  t^.  and  u*  =  '  leading  to 

Ktf 

Am*  <  (61) 

2m* 

This  inequality  is  of  course  valid  only  if  the  product  Kt^.  is  sufficiently 
smal 1 . 

From  Table  1  it  can  be  clearly  seen  that  the  difference  Am*  in 
the  optimal  miss  distances  is  well  within  the  bounds  predicted  by  Eq.(60) 
and  (61) . 
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Actual  Value 

Predicted 

upper  bounds 

N' 

m* 

K 

Am* 

(%) 

Eq. (60) 

Eq . (61 ) 

in* 

10"4 

1 .03xl0~ 4 

0.02 

1 .53*10'3 

2.8*103 

3 

0.541 

10'2 

1.4  *10'2 

2.58 

0.21 

0.28 

10'4 

2.9  *10~4 

0.1 

2 . 53*10' 3 

5 . 1*10~  3 

5 

0.294 

10'3 

5.4  *10' 3 

1.83 

3.49*10-2 

5. 1*102 

TABLE  1.  Actual  loss  in  the  optimal  miss  distance  and  its  predicted 
upper  bounds  (t^.  =  30) . 


VI 


RECURSIVE  ALGORITHM  FOR  NUMERICAL  SOLUTION 


A.  Linear  Model 

The  state  equation  (12)  yields  a  formal  solution  of  the  form 


X(t)  =  4-(t,t0)X(0) 


^(t,€)u(5)d? 


(62) 


where  T(t,t0)  is  the  state  transition  matrix  of  the  homogeneous  time- 
varying  linear  system 


X  =  A(t) X 


(63) 


This  solution  can  be  expressed  in  terms  of  confluent  hypergeometric 
functions,  which  are  not  suitable  for  explicit  analysis.  Therefore,  a 
complete  solution  of  the  optimal  problem  requires  some  numerical  aid. 
The  existence  of  explicit,  time-dependent  expression  for  f  (?)  which 
serves  as  a  switch  function  of  the  "bang-bang"  solution  in  the  original 
problem  •  (K  =  0)  provides  a  handy  algorithm  for  the  solution  of  the 
modified  optimal  evasion. 

The  procedure  is  the  following: 

1.  Known  the  function  f?(t)  from  Eqs.  '  )  :u  d  solve  Eq  .(12)  and 

compute  the  miss  distance  m*,  obtained  by  "bang-bang"  control 
u*  =  sign[f .,(?)] .  Pose 


m. 

1 


1 


(04) 


-  )»*  i  = 


where  i  is  the  iteration  index. 


2.  Solve  Kq .(12)  with  the  new  control 


u 


i+  1 


fill. 

sat  j— ■  f7(t) 

i  N  - 


and  compute  the  new  miss  distance 


"i  +  1 


(65) 


3.  Advance  the  index  i  by  a  unit  and  repeat  step  2  until  a  required 
convergence  criterion  for  m*  is  satisfied. 


This  algorithm  has  a  very  fast  convergence  for  values  of  1C  <  10 
I n  most  of  the  practical  cases,  1  to  3  iterations  are  needed  to  obtain  a 
relative  error  of  0.1®»  in  the  miss  distance. 


R  F.xtension  to  Other  Cases 

In  a  realistic  description  of  the  problem  of  optimal  missile  avoidance 
the  assumptions  of  infinite  missile  maneuverability  and  instantaneous  target 
response  (implying  an  infinite  roll-rate),  enabling  to  solve  a  linear 
problem,  have  to  be  abandoned.  In  previous  works10’11  it  was  shown  that 
taking  into  account  the  limits  of  missile  lateral  acceleration  and  target 
roll-rate  does  not  change  the  "bang-bang"  structure  of  the  optimal  control 
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solution.  However,  due  to  the  non-linear  effects  the  solution  has  to  be 
obtained  numerically.  Assume  that  the  original  solution  for  K=0  is 
obtained  and  it  seems  to  generate  large  variation  of  the  interception 
geometry,  which  invalidate  trajectory  linearization.  (This  solution 
includes  the  optimal  switch  function  A^(f)  obtained  by  some  numerical 
method).  The  appropriate  value  of  K  >  0  that  guarantees  validity  of 
linearized  kinematics  can  be  estimated  using  the  same  equation  (47)  as  in 
the  presented  case,  giving  to  f0(t)  the  correct  interpretation. 

In  order  to  obtain  the  optimal  solution  of  the  modified  missile 
avoidance  problem  (K  >  0)  the  recursive  algorithm  presented  in  the  previous 
subsection  can  he  used.  The  only  difference  is  that  for  each  iteration  a 
new  switch  function  has  to  be  computed  numerically. 


V IJ _ COMPARISON  TO  NON -MNl'.AR  SIMULATION 

Solutions  of  modified  optimal  missile  avoidance  problems  with 
linearized  kinematics  were  compared  to  results  of  non-linear  simulations. 
In  the  simulation  the  optimal  control  functions  obtained  by  the  linearized 
model  were  used.  The  comparison  was  carried  out  for  several  values  of  N', 
K  and  t^  The  constant  parameters  used  for  the  comparison  were: 

t  =  0.35  sec 


710  m/sec 


25 


(a_)  =  50  m/sec2 

T  max 

=  250  m/sec 

Thus  the  value  of  the  dynamic  similarity  parameter,13  alj,,  defined  in 
Eq.(38),  is  =  0.07  rad. 

In  Table  2  results  of  the  comparison  are  presented  for  t^.  =  30,  i.e., 
predicted  flight  time  of  t^.  =  10.5  sec  for  the  linear  model.  In  the 
Table,  T^  denotes  the  actual  time  of  flight  and  M*  is  the  normalized 
miss  distance  obtained  by  the  simulation. 

From  the  results  of  Table  2  the  following  conclusions  can  be  drawn: 

a.  For  small  (or  zero)  values  of  the  penalization  coefficient  K  which 
allow  large  changes  in  the  geometry  (Ay^  >  30°)  the  miss  distance 
obtained  by  the  simulation  is  smaller  than  the  linearized  prediction. 
Moreover,  the  actual  time  of  flight  T^.  also  differs  from  the  predicted 
value  t^.  Both  observations  indicate  that  assumption  of  trajectory 
linearization  is  not  valid  in  these  cases. 

b.  If  the  direction  change  of  the  target  Ay^  is  kept  below  30°  the 
miss  distance  obtained  by  non-linear  simulation  is  slightly  larger  than 
the  value  predicted  by  the  linearized  model. 

c.  If  Ay^  <  30°  the  actual  time  of  flight  is  well  approximated  by  the 
linearized  prediction  and  the  resulting  miss  distance  is  even  larger  than 
the  value  predicted  by  the  linear  model  with  K=0. 


Simulation 


N' 

K  1 

1 

m* 

M* 

AYT(deg) 

Tf  (sec) 

0 

0.541 

0.138 

95.4 

9.08 

3 

10*  4 

0.541 

0.496 

46.0 

10.33 

10'2 

0.537 

0.551 

23.9 

10.47 

0 

0.294 

0.177 

73.3 

9.41 

5 

10"4 

0.294 

0.300 

30.3 

10.41 

10'2 

0.243 

0.256 

1.1 

-  ■  — 

10.50 

TABLE  2.  Comparison  of  results  of  the  linearized  model  to  non-linear 
simulation . 


VJII.  CONCIDSIONS 

The  work  presented  in  this  paper  showed  that  the  validity  of  linoari 
kincmatical  model  used  in  the  analysis  of  optimal  missile  avoidance  0  ’  1  c;. 
be  largely  extended  hy  adding  a  control  effort  penalization  term  to  the 
original  pay-off  function.  Both  linear  prediction  and  simulation  indicate 
that  the  loss  of  miss  distance  due  to  the  control  effort  penalization  is 
of  no  practical  significance.  In  the  present  paper  a  2-h  case  was 
analysed  merely  for  sake  of  simplicity.  ITie  method  is  equally  applicable 
for  3-0  optimal  missile  avoidance. 


i 
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CLOSED  FORM  SOLUTION  OF  THE  COSTATE  VECTOR  FOR  LINEAR  FIRST-ORPER  DYNAMICS 

The  costate  equations  of  the  problem  are  given  by  Hq.(24) 

dA/dO  =  a"‘  (6)  A  (a-  1 ) 

with  the  initi;,]  conditions  of  Hq.(25). 
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A  1 

I 


According  to  Leibnitz  rule  for  the  derivative  of  a  product' 


=  i„(0[^f(e,]f9W 


where  the  binomial  coefficients  (.  )  are  defined  by 


(A-6) 


(A  _ 

W  i ! Cj-i) ! 


the  i-th  derivative  of  the  expression 


f(0)  =  e' 


(A-8 1 


yields 


d  -0  ,  .  i  -6 

— r  e  =  (-1)  e 

d0 


(A-9) 


and  the  i-th  derivative  of  the  expression 


g(6)  «  Jr 


(A- 1 0) 
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yields,  provided  i  ^ 

d1  e*  0jui  e1"1 

d0i  If  n  '  (1-i)! 

Substituting  (A-9)  and  (A-ll)  in  (A-6)  we  obtain 


(A- 1 1 ) 


(s+1) 


I+T 


1  dj  f  -e  6Z)  -0 

r^TVe  rry  =  e 


j 

[  (-n*-1 

i=0 


(0 

T^TIT 


(A- 12) 


Equations  (A-3)  and  (A-4)  are  solved  by  direct  application  of  Eq. (A-12) 
yielding 


N'-2 


/N ' -2  ^ 

,  rAN  _  -0  Y  ,  ,  -.N '  -2-i  V  i  J 

A2(6)  ~c  e  L  (N'-l-i)! 


N'-l-i 


i=0 


CA- 13) 


N'  -2 


A3(0) 


-  C 


e'0  l  (-1) 


/N' -2^ 

N*-2-i  V  i  J  nN'-j 
(N'-i)! 


i  =  0 


(A- 14) 


A ^ ( 0 )  in  Eq.(A-2)  can  be  obtained  by  using  Eq.(A-12)  in  the  expression 


(A- 


A  (6)  =  L~  tsA  (s)  ]  =  L  -c  — - rrr 

*  L  (s+1)  J 


N'-l 


=  -c  e 


j  r_ ! )N '  - 1  -i  — ^ — i_jL  0N'"1_i 

L  y  J  (N'-l-i)! 


i=0 


by  substituting  Eqs.(A-15)  and  (A-14)  into  Eq.(A-2)  and  rearranging,  we 
obtain 


Aj(0)  = 


-c  e 


N'-l 

I 

i  =  l 


(-1) 


N'-l-i 


(N ' -2) 


aN'-l-i 


(i-l).'(N'-i)!  (N'-l-i).1 


(A- 


The  initial  conditions  of  Eq. (A-l)  are  satisfied  by  Eqs.(A-13),  (A-14) 
and  (A-16).  The  constant  c  is  determined  by  Eqs.(A-16)  and  (20) 


Aj (0)  =  -c  =  -2y(tf)  =  -2m 


l-i 

£ 


yielding 


c  =  ..m 


(A- 17) 


\i 
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Normalized  miss-distance  sensitivity  function. 


Fig.  4.  Optimal  avoidance  control  for  different  values  of  penalization 
coefficient  (N*  =  5). 


